Channel scheduling in optical routers

ABSTRACT

An optical switch network ( 4 ) includes optical routers ( 10 ), which route information in optical fibers ( 12 ). Each fiber carries a plurality of data channels ( 20 ), collectively a data channel group ( 14 ), and a control channel ( 16 ). Data is carried on the data channels in data bursts and control information is carried on the control channel ( 18 ) in burst header packets. A burst header packet includes routing information for an associated data burst ( 28 ) and precedes its associated data burst. Parallel scheduling at multiple delays may be used for faster scheduling. In one embodiment, unscheduled times and gaps may be processed in a unified memory for more efficient operation.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of the filing date of copending provisional application U.S. Ser. No. 60/257,487, filed Dec. 22, 2000, ;entitled “Channel Scheduling in Optical Routers” to Xiong.

[0002] This application is related to U.S. Ser. No. 09/569,488 filed May 11, 2000, entitled, “All-Optical Networking Optical Fiber Line Delay Buffering Apparatus and Method”, which claims the benefit of U.S. Ser. No. 60/163,217 filed Nov. 2, 1999, entitled, “All-Optical Networking Optical Fiber Line Delay Buffering Apparatus and Method” and is hereby fully incorporated by reference. This application is also related to U.S. Ser. No. 09/409,573 filed Sep. 30, 1999, entitled, “Control Architecture in Optical Burst-Switched Networks” and is hereby incorporated by reference. This application is further related to U.S. Ser. No. 09/689,584, filed Oct. 12, 2000, entitled “Hardware Implementation of Channel Scheduling Algorithms For Optical Routers With FDL Buffers,” which is also incorporated by reference herein.

[0003] This application is further related to U.S. Ser. No. ______ (Attorney Docket 135779), filed concurrently herewith, entitled “Unified Associative Memory of Data Channel Schedulers in an Optical Router” to Zheng et al and U.S. Ser. No. ______ (Attorney Docket 135818), filed concurrently herewith, entitled “Optical Burst Scheduling Using Partitioned Channel Groups” to Zheng et al.

STATEMENT OF FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

[0004] Not Applicable

BACKGROUND OF THE INVENTION

[0005] 1. TECHNICAL FIELD

[0006] This invention relates in general to telecommunications and, more particularly, to a method and apparatus for optical switching.

[0007] 2. DESCRIPTION OF THE RELATED ART

[0008] Data traffic over networks, particularly the Internet, has increased dramatically recently, and will continue as the user increase and new services requiring more bandwidth are introduced. The increase in Internet traffic requires a network with high capacity routers capable of routing data packets of variable length. One option is the use of optical networks.

[0009] The emergence of dense-wavelength division multiplexing (DWDM) technology has improved the bandwidth problem by increasing the capacity of an optical fiber. However, the increased capacity creates a serious mismatch with current electronic switching technologies that are capable of switching data rates up to a few gigabits per second, as opposed to the multiple terabit per second capability of DWDM. While emerging ATM switches and IP routers can be used to switch data using the individual channels within a fiber, typically at a few hundred gigabits per second, this approach implies that tens or hundreds of switch interfaces must be used to terminate a single DWDM fiber with a large number of channels. This could lead to a significant loss of statistical multiplexing efficiency when the parallel channels are used simply as a collection of independent links, rather than as a shared resource.

[0010] Different approaches advocating the use of optical technology in place of electronics in switching systems have been proposed; however, the limitations of optical component technology has largely limited optical switching to facility management/control applications. One approach, called optical burst-switched networking, attempts to make the best use of optical and electronic switching technologies. The electronics provides dynamic control of system resources by assigning individual user data bursts to channels of a DWDM fiber, while optical technology is used to switch the user data channels entirely in the optical domain.

[0011] Previous optical networks designed to directly handle end-to-end user data channels have been disappointing.

[0012] Therefore, a need has arisen for a method and apparatus for providing an optical burst-switched network.

BRIEF SUMMARY OF THE INVENTION

[0013] The present invention provides circuitry for scheduling data bursts in an optical burst-switched router. An optical switch routes optical information from an incoming optical transmission medium to one of a plurality of outgoing optical transmission media. A delay buffer coupled to the optical switch provides n different delays for delaying information between the incoming transmission medium and the outgoing transmission media. A scheduling circuit is associated with each outgoing medium; the scheduling circuits each comprise n+1 associative processors. Each associative processor includes circuitry for (1) storing scheduling information for the associated outgoing optical transmission medium relative to a respective one of the n delays and for a zero delay and (2) identifying available time periods relative to the respective delays in which a data burst may be scheduled.

[0014] The present invention provides a fast and efficient method for scheduling bursts in an optical burst-switched router.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

[0015] For a more complete understanding of the present invention, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:

[0016]FIG. 1a is a block diagram of an optical network;

[0017]FIG. 1b is a block diagram of a core optical router;

[0018]FIG. 2 illustrates a data flow of the scheduling process;

[0019]FIG. 3 illustrates a block diagram of a scheduler;

[0020]FIGS. 4a and 4 b illustrate timing diagrams of the arrival of a burst header packet relative to a data burst;

[0021]FIG. 5 illustrates a block diagram of a DCS module;

[0022]FIG. 6 illustrates a block diagram of the associative memory of P_(M);

[0023]FIG. 7 illustrates a block diagram of the associative memory of P_(G);

[0024]FIG. 8 illustrates a flow chart of a LAUC-VF scheduling method;

[0025]FIG. 9 illustrates a block diagram of a CCS module;

[0026]FIG. 10 illustrates a block diagram of the associative memory of PT;

[0027]FIG. 11 illustrates a flow chart of a constrained earliest time method of scheduling the control channel;

[0028]FIG. 12 illustrates a block diagram of the path & channel selector;

[0029]FIG. 13 illustrates a example of a blocked output channel through the recirculation buffer;

[0030]FIG. 14 illustrates a memory configuration for a memory of the BHP transmission module;

[0031]FIG. 15 illustrates a block diagram of an optical router architecture using passive FDL loops;

[0032]FIG. 16 illustrates an example of a path & channel scheduler with multiple P_(M) and P_(G) pairs;

[0033]FIGS. 17a and 17 b illustrate timing diagram of outbound data channels;

[0034]FIG. 18 illustrates clock signals for CLK_(f) and CLK_(s);

[0035]FIGS. 19a and 19 b illustrate alternative hardware modifications for slotted operation of a router;

[0036]FIG. 20 illustrates a block diagram of an associative processor P_(M);

[0037]FIG. 21 illustrates a block diagram of an associative processor P_(G);

[0038]FIG. 22 illustrates a block diagram of an associative processor P_(MG);

[0039]FIG. 23 illustrates a block diagram of an associative processor P*_(MG);

[0040]FIG. 24 illustrates a block diagram of an embodiment using multiple associative processors for fast scheduling;

[0041]FIG. 25 illustrates a block diagram of a processor P_(M-ext) for use with multiple channel groups; and

[0042]FIG. 26 illustrates a block diagram of a processor P_(G-ext) for use with multiple channel groups.

DETAILED DESCRIPTION OF THE INVENTION

[0043] The present invention is best understood in relation to FIGS. 1-26 of the drawings, like numerals being used for like elements of the various drawings.

[0044]FIG. 1a illustrates a general block diagram of an optical burst switched network 4. The optical burst switched (OBS) network 4 includes multiple electronic ingress edge routers 6 and multiple egress edge routers 8. The ingress edge routers 6 and egress edge routers 8 are coupled to multiple core optical routers 10. The connections between ingress edge routers 6, egress edge routers 8 and core routers 10 are made using optical links 12. Each optical fiber can carry multiple channels of optical data.

[0045] In operation, a data burst (or simply “burst”) of optical data is the basic data block to be transferred through the network 4. Ingress edge routers 6 and egress edge routers 8 are responsible for burst assembly and disassembly functions, and serve as legacy interfaces between the optical burst switched network 4 and conventional electronic routers.

[0046] Within the optical burst switched network 4, the basic data block to be transferred is a burst, which is a collection of packets having some common attributes. A burst consists of a burst payload (called “data burst”) and a burst header (called “burst header packet” or BHP). An intrinsic feature of the optical burst switched network is that a data burst and its BHP are transmitted on different channels and switched in optical and electronic domains, respectively, at each network node. The BHP is sent ahead of its associated data burst with an offset time τ (≧0). Its initial value, τ₀, is set by the (electronic) ingress edge router 8.

[0047] In this invention, a “channel” is defined as a certain unidirectional transmission capacity (in bits per second) between two adjacent routers. A channel may consist of one wavelength or a portion of a wavelength (e.g., when time-division multiplexing is used). Channels carrying data bursts are called “data channels”, and channels carrying BHPs and other control packets are called “control channels”. A “channel group” is a set of channels with a common type and node adjacency. A link is defined as a total transmission capacity between two routers, which usually consists of a “data channel group” (DCG) and a “control channel group” (CCG) in each direction.

[0048]FIG. 1b illustrates a block diagram of a core optical router 10. The incoming DCG 14 is separated from the CCG 16 for each fiber 12 by demultiplexer 18. Each DCG 14 is delayed by a fiber delay line (FDL) 19. The delayed DCG is separated into channels 20 by demultiplexer 22. Each channel 20 is input to a respective input node on a non-blocking spatial switch 24. Additional input and output nodes of spatial switch 24 are coupled to a recirculation buffer (RB) 26. Recirculation buffer 26 is controlled by a recirculation switch controller 28. Spatial switch 24 is controlled by a spatial switch controller 30.

[0049] CCGs 14 are coupled to a switch control unit (SCU) 32. SCU includes an optical/electronic transceiver 34 for each CCG 14. The optical/electronic transceiver 34 receives the optical CCG control information and converts the optical information into electronic signals. The electronic CCG information is received by a packet processor 36, which passes information to a forwarder 38. The forwarder for each CCG is coupled to a switch 40. The output nodes of switch 40 are coupled to respective schedulers 42. Schedulers 42 are coupled to a Path & Channel Selector 44 and to respective BHP transmit modules 46. The BHP transmit modules 46 are coupled to electronic/optical transceivers 48. The electronic/optical transceivers produce the output CCG 52 to be combined with the respective output DCG 54 information by multiplexer 50. Path & channel selector 44 is also coupled to RB switch controller 28 and spatial switch controller 30.

[0050] The embodiment shown in FIG. 1b has N input DCG-CCG pairs and N output DCG-CCG pairs 52, where each DCG has K channels and each CCG has only one channel (k=1). A DCG-CCG pair 52 is carried in one fiber. In general, the optical router could be asymmetric, the number of channels k of a CCG 16 could be larger than one, and a DCG-CCG pair 52 could be carried in more than one fiber 12. In the illustrated embodiment, there is one buffer channel group (BCG) 56 with R buffer channels. In general, there could be more than one BCG 56. The optical switching matrix (OSM) consists of a (NK+R)×(NK+R) spatial switch and a R×R switch with WDM (wavelength division multiplexing) FDL buffer serving as recirculation buffer (RB) 26 to resolve data burst contentions on outgoing data channels. The spatial switch is a strictly non-blocking switch, meaning that an arriving data burst on an incoming data channel can be switched to any idle outgoing data channel. The delay Δ introduced by the input FDL 19 should be sufficiently long such that the SCU 32 has enough time to -process a BHP before its associated data burst enters the spatial switch.

[0051] The R×R RB switch is a broadcast-and-select type switch of the type described in P. Gambini, et al., “Transparent Optical Packet Switching Network Architecture and Demonstrators in the KEOPS Project”, IEEE J. Selected Areas in Communications, vol. 16, no. 7, pp. 1245-1259, September 1998. It is assumed that the R×R RB switch has B FDLs with the ith FDL introducing Q₁ delay time, 1≦i≦B. It is further assumed without loss of generality that Q₁<Q₂< . . . <Q_(B) and Q₀ =0, meaning no FDL buffer is used. Note that the FDL buffer is shared by all N input DCGs and each FDL contains R channels. A data burst entering the RB switch on any incoming channel can be delayed by one of B delay times provided. The recirculation buffer in FIG. 1b can be degenerated to passive FDL loops by removing the function of RB switch, as shown in FIG. 15, wherein different buffer channels may have different delays.

[0052] The SCU is partially based on an electronic router. In FIG. 1b, the SCU has N input control channels and N output control channels. The SCU mainly consists of packet processors (PPs) 36, forwarders 38, a switching fabric 40, schedulers 42, BHP transmission modules 46, a path & channel selector 44, a spatial switch controller 30, and a RB switch controller 28. The packet processor 36, the forwarders 38, and the switching fabric 40 can be found in electronic routers. The other components, especially the scheduler, are new to optical routers. The design of the SCU uses the distributed control as much as possible, except the control to the access of shared FDL buffer which is centralized.

[0053] The packet processor performs layer 1 and layer 2 decapsulation functions and attaches a time-stamp to each arriving BHP, which records the arrival time of the associated data burst to the OSM. The time-stamp is the sum of the BHP arrival time, the burst offset-time τ carried by the BHP and the delay A introduced by input FDL 19. The forwarder mainly performs the forwarding table lookup to decide which outgoing CCG 52 to forward the BHP. The associated data burst will be switched to the corresponding DCG 54. The forwarding can be done in a connectionless or connection-oriented manner.

[0054] There is one scheduler for each DCG-CCG pair 52. The scheduler 42 schedules the switching of the data burst on a data channel of the outgoing DCG 54 based on the information carried by the BHP. If a free data channel is found, the scheduler 42 will then schedule the transmission of the BHP on the outgoing control channel, trying to “resynchronize” the BHP and its associated data burst by keeping the offset time τ (≧0) as close as possible to τ₀. After both the data burst and BHP are successfully scheduled, the scheduler 42 will send the configuration information to the spatial switch controller 30 if it is not necessary to provide a delay through the recirculation buffer 26, otherwise it will also send the configuration information to the RB switch controller 28.

[0055] The data flow of scheduling decision process is shown in FIG. 2. In decision block 60, the scheduler 42 determines whether or not there is enough time to schedule an incoming data burst. If so, the scheduler determines whether the data burst can be scheduled, i.e., whether there is an unoccupied space in the specified output DCG 54 for the data burst. In order to schedule the data burst, there must be an available space to accommodate the data burst in the specified output DCG. This space may start within a time window beginning at the point of arrival of the data burst at the spatial switch 24 extending to the maximum delay which can be provided by the recirculation buffer 26. If the data burst can be scheduled, then the scheduler 42 must determine whether there is a space available in the output CCG 52 for the BHP in decision block 64.

[0056] If any of the decisions in decision blocks 60, 62 or 64 are negative, the data burst and BHP are dropped in block 65. If all of the decisions in decision blocks 60, 62 and 64 are positive, the scheduler sends the scheduling information to the path and channel selector 44. The configuration information from scheduler to path & channel selector includes incoming DCG identifier, incoming data channel identifier, outgoing DCG identifier, outgoing data channel identifier, data burst arrival time to the spatial switch, data burst duration, FDL identifier i (Q_(i) delay time is requested, 0≦i≦B).

[0057] If the FDL identifier is 0, meaning no FDL buffer is required, the path & channel selector 44 will simply forward the configuration information to the spatial switch controller 30. Otherwise, the path & channel selector 44 searches for an idle incoming buffer channel to the RB switch 26 in decision block 68. If found, the path and channel selector 44 searches for an idle outgoing buffer channel from the RB switch 26 to carry the data burst reentering the spatial switch after the specified delay inside the RB switch 26 in decision block 70. It is assumed that once the data burst enters the RB switch, it can be delayed for any discrete time from the set {Q₁, Q₂, . . . , Q_(B) }. If this is not the case, the path & channel selector 44 will have to take the RB switch architecture into account. If both idle channels to and from the RB switch 26 are found, the path & channel selector 44 will send configuration information to the spatial switch controller 30 and the RB switch controller 28 and send an ACK (acknowledgement) back to the 42 scheduler. Otherwise, it will send a NACK (negative acknowledgement) back to the scheduler 42 and the BHP and data burst will be discarded in block 65.

[0058] Configuration information from the path & channel selector 44 to the spatial switch controller 30 includes incoming DCG identifier, incoming data channel identifier, outgoing DCG identifier, outgoing data channel identifier, data burst arrival time to the spatial switch, data burst duration, FDL identifier i (Q_(i) delay time is requested, 0≦i≦B). If i>0, the information also includes the incoming BCG identifier (to the RB switch), incoming buffer channel identifier (to the RB switch), outgoing BCG identifier (from the RB switch), and outgoing buffer channel identifier (from the RB switch).

[0059] Configuration information from path & channel selector to RB switch controller includes an incoming BCG identifier (to the RB switch), incoming buffer channel identifier (to the RB switch), outgoing BCG identifier (from the RB switch), outgoing buffer channel identifier (from the RB switch), data burst arrival time to the RB switch, data burst duration, FDL identifier i (Q_(i) delay time is requested, 1≦i≦B).

[0060] The spatial switch controller 30 and the RB switch controller 28 will perform the mapping from the configuration information received to physical components that involved in setting up the internal path(s), and configure the switches just-in-time to let the data burst fly-through the optical router 10. When the FDL identifier is larger than 0, the spatial switch controller will set up two internal paths in the spatial switch, one from the incoming data channel to the incoming recirculation buffer channel when the data burst arrives to the spatial switch, another from the outgoing buffer channel to the outgoing data channel when the data burst reenters the spatial switch. Upon receiving the ACK from the path & channel selector 44, the scheduler 42 will update the state information of selected data and control channels, and is ready to process a new BHP.

[0061] Finally, the BHP transmission module arranges the transmission of BHPs at times specified by the scheduler.

[0062] The above is the general description on how the data burst is scheduled in the optical router. Recirculating data bursts through the R×R recirculation buffer switch more than once could be easily extended from the design principles described below if so desired.

[0063]FIG. 3 illustrates a block diagram of a scheduler 42. The scheduler 42 includes a scheduling queue 80, a BHP processor 82, a data channel scheduling (DCS) module 84, and a control channel scheduling (CCS) module 86. Each scheduler needs only to keep track of the busy/idle periods of its associated outgoing DCG 54 and outgoing CCG 52.

[0064] BHPs arriving from the electronic switch are first stored in the scheduling queue 80. For basic operations, all that is required is one scheduling queue 80, however, virtual scheduling queues 80 may be maintained for different service classes. Each queue 80 could be served according to the arrival order of BHPs or according to the actual arrival order of their associated data bursts. The BHP processor 82 coordinates the data and control channel scheduling process and sends the configuration to the path & channel selector 44. It could trigger the DCS module 84 and the CCS module 82 in sequence or in parallel, depending on how the DCS and CCS modules 84 and 82 are implemented.

[0065] In the case of serial scheduling, the BHP processor 82 first triggers the DCS module 84 for scheduling the data burst (DB) on a data channel in a desired output DCS 54. After determining when the data burst will be sent out, the BHP processor then triggers the CCS module 86 for scheduling the BHP on an associated control channel.

[0066] In the case of parallel scheduling, the BHP processor 82 triggers the DCS module 84 and CCS module 86 simultaneously. Since the CCS module 86 does not know when the data burst will be sent out, it schedules the BHP for all possible departure times of the data burst or its subset. There are in total B+1 possible departure times. Based on the actual data burst departure time reported from the DCS module 84, the BHP processor 86 will pick the right time to send out the BHP.

[0067] Slotted transmission is used in data and control channels between edge and core and between core nodes in the OBS network. A slot is a fixed-length time period. Let T_(s) be the duration (e.g., in μs) of a time slot in data channels and T_(f) be the duration of a time slot in control channels. T_(s)·r_(d) Kbits of information can be sent during a slot if the data channel speed is rd gigabits per second. Similarly, T_(f)·r_(c) Kbits of information can be sent during a slot if the control channel speed is r_(c) gigabits per second. Two scenarios are considered, (1) r_(c)=r_(d) and (2) r_(c)≠r_(d). In the latter case, a typical example is that r_(c)=r_(d)/4 (e.g., OC-48 is used in control channels and OC-192 is used in data channels).

[0068] Without loss of generality, it is assumed that T_(f) is equal to multiples of T_(s). Two examples are depicted in FIGS. 4a and 4 b (see also, FIG. 18), which illustrates the timestamp and burst offset-time in a slotted transmission system for the cases where T_(f)=T_(s) and T_(f)=4T_(s), with the initial offset time τ₀=8T_(s). To simplify the description, we use timeframe to designate time slot in control channels. It is further assumed without loss of generality that, (1) data bursts are variable length, in multiple of slots, which can only arrive at slot boundaries, and (2) BHPs are also variable length, in, for instance, multiple of bytes. Fixed-length data bursts and BHPs are just special cases. In slotted transmission, there is some overhead in each slot for various purposes like synchronization and error detection. Suppose the frame payload on control channels is P_(f) bytes, which is less than (T_(f)·r_(c))·1000/8 bytes, the total amount of information can be transmitted in a time frame.

[0069] The OSM is configured periodically. For slotted transmission on data channels, a typical example of the configuration period is one slot, although the configuration period could also be a multiple of slots. Here it is assumed that the OSM is configured every slot. The length of a FDL Q_(i) needs also to be a multiple of slots, 1≦i≦B. Due to the slotted transmission and switching, it is suggested to use the time slot as a basic time unit in the SCU for the purpose of data channel scheduling, control channel scheduling and buffer channel scheduling, as well as synchronization between BHPs and their associated data bursts. This will simplify the design of various schedulers.

[0070] The following integer variables are used in connection with FIGS. 4a, 4 b and 5:

[0071] t_(BHP): the beginning of a time frame during which the BHP enters the SCU;

[0072] t_(DB): the arrival time of a data burst (DB) to the optical switching matrix (OSM);

[0073] l_(DB): the duration/length of a DB in slots;

[0074] Δ: delay (in slots) introduced by input FDL

[0075] τ: burst offset-time (in slots).

[0076] Each arriving BHP to the SCU is time-stamped at the transceiver interface, right after O/E conversion, recording the beginning of the time frame during which the BHP enters the SCU. For the BHPs received by the SCU in the same time frame, they will have the same timestamp t_(BHp). For scheduling purpose, the most important variable is t_(DB), the DB arrival time to the OSM. Suppose a b-bit slot counter is used in the SCU to keep track of time, t_(DB) can be calculated as follows.

t _(DB)=(t _(BHP) ·T _(f)+Δ+τ)mod2b.  (1)

[0077] Timestamp t_(DB) will be carried by the BHP within the SCU 32. Note that the burst offset-time τ is also counted starting from the beginning of the time frame that the BHP arrives as shown in FIGS. 4a-b, where in FIG. 4a, t_(BHp =)9 and τ=6 slots, and in FIG. 4b, t_(BHP)=2 and τ=7 slots. Suppose Δ=100 slots, we have t_(DB)=115, meaning that the DB will arrive at slot boundary 115. In FIGS. 4a-b, 1≦τ≦τ₀=8. It is assumed without loss of generality that the switching latency of the spatial switch in FIG. 1b is negligible. So the data burst arrival time t_(DB) to the spatial switch 24 is also its departure time if no FDL buffer is used. Note that even if the switching latency is not negligible, t_(DB) can still be used as the data burst departure time in channel scheduling as the switching latency is compensated at router output ports where data and control channels are resynchronized.

[0078]FIG. 5 illustrates a block diagram of a DCS module 84. In this embodiment, associative processor arrays P_(M) 90 and P_(G) 92 perform parallel searches of unscheduled channel times and gaps between scheduled channel times and update state information. Gaps and unscheduled times are represented in relative times. P_(M) 90 and P_(G) 92 are coupled to control processor CP₁ 94. In one embodiment, a LAUC-VF (Latest Available Unused Channel with Void Filling) scheduling principle is used to determine a desired scheduling, as described in connection with U.S. Ser. No. 09/689,584, entitled “Hardware Implementation of Channel Scheduling Algorithms of Optical Routers with FDL Buffers” to Zheng et al, filed Oct. 12, 2000, and which is incorporated by reference herein.

[0079] The DCS module 84 uses two b-bit slot counters, C and C₁. Counter C keeps track of the time slots, which can be shared with the CCS module 86. Counter C₁ records the elapsed time slots since the last BHP is received. Both counters are incremented by every pulse of the slot clock. However, counter C₁ is reset to 0 when the DCS module 84 receives a new BHP. Once counter C₁ reaches 2^(b)-1, it stops counting, indicating that at least 2^(b)-1 slots have elapsed since the last BHP. The value of b should satisfy 2^(b)≧W_(s) where W_(s) is the data channel scheduling window. W_(s)=τ₀+Δ+Q_(B)+L_(max)−δ, where L_(max) is the maximum length of a DB and δ is the minimum delay of a BHP from O/E conversion to the scheduler 42. Assuming that τ₀=8, Δ=120, Q_(B)=32, L_(max)=64, and δ=40, then W_(s)=184 slots. In this case, b=8 bits.

[0080] Associative processor P_(M) in FIG. 5 is used to store the unscheduled time of each data channel in a DCG. Let t_(i) be the unscheduled time of channel H_(i) which is stored in ith entry of P_(M) 0≦i≦K−1. Then from slot t_(i) onwards, channel H_(i) is free, i.e., nothing being scheduled. t_(i) is a relative time, with respect to the time slot that the latest BHP is received by the scheduler. P_(M) has an associative memory of 2K words to store the unscheduled times t, and channel identifiers, respectively. The unscheduled times are stored in descending order. For example, in FIG. 6 we have K=8 and t₀≧t₁≧t₂≧t₃≧t₄∝t₅≧t₆≧t₇.

[0081] Similarly, associative processor P_(G) in FIG. 5 is used to store the gaps of data channels in a DCG. We use l_(i), and r_(j) to denote the start time and ending time of gap j, 0≦j≦G−1, which are also relative times. This gap is stored in jth entry of P_(G) and its corresponding data channel is H. P_(G) has an associative memory of G words to store the gap start time, gap ending time, and channel identifiers, respectively. Gaps are also stored in the descending order according to their start times l_(j). For example, FIG. 7 illustrates the associative memory of P_(G), where l₀≧l₁≧l₂≧ . . . l_(G−2)≧l_(G−1). G is the total number of gaps that can be stored. If there are more than G gaps, the newest gap with larger start time will push out the gap with the smallest start time, which resides at the bottom of the associative memory. Note that if l_(J)=0, then there are in total j gaps in the DCG, as l_(j+1)=l_(j+2)= . . . =l_(G−1)=0.

[0082] Upon receiving a request from the BHP processor to schedule a DB with departure time t_(DB) and duration l_(DB), the control processor (CP₁) 94 first records the time slot th during which it receives the request, reads counter C₁ (t_(e)←C₁) and reset C₁ to 0. Using t_(sch) as a new reference time, the CP₁ then calculates the DB departure time (no FDL buffer) with respect to t_(sch) as

t′ _(DB)=(t _(DB) −t _(sch)+2^(b))mod ₂ ^(b),  (2)

[0083] In the meantime, CP₁ updates P_(M) using

t _(i)=max(0, t _(i) −t _(e)), 0≦i≦K−1  (3)

[0084] and updates P_(G) using the following formulas,

l_(j)=max(0, l_(j) −t _(e)), 0≦j≦G−1  (4)

[0085] and

r _(j)=max(0, r _(j) −t _(e)), 0≦j≦G−1 .  (5)

[0086] After the memory update, CP₁ 94 arranges the search of eligible outgoing data channels to carry the data burst according to the LAUC-VF method, cited above. The flowchart is given in FIG. 8. In block 100, and index i is set to “0”. In block 102, P_(G) finds a gap in which to transmit the data burst t′_(DB)+Q_(i). In blocks 106, P_(M) finds an unscheduled channel in P_(M) to transmit the data burst at t′_(DB)+Q_(i). Note that the operations of finding a gap in P_(G) to transmit the DB at time t′_(DB)+Q_(i) and finding an unscheduled time in P_(M) to transmit the DB at time t′_(DB)+Q_(i) are preferably performed in parallel. The operation of finding a gap in P_(G) to transmit the data burst at time t′_(DB)+Q, (block 102) includes parallel comparison of each entry in P_(G) with (t′_(DB)+Q_(i), t′_(DB)+Q_(i)+l_(DB)) If t′_(DB)+Q_(i)>l_(j) and t′_(DB)+Q_(i)+l_(DB)≦r_(j), the response bit of entry j returns 1, otherwise it returns 0, 0≦j≦G−1. If at least one entry in P_(G) returns 1, the gap with the smallest index is selected.

[0087] The operation finding an unscheduled time in P_(M) to transmit the DB at time t′_(DB)+Q_(i) (block 106) includes parallel comparison of each entry in P_(M) with t′_(DB)+Q_(i). If t′_(DB)+Q_(i)≧t_(j), the response bit of entry j returns 1, otherwise it returns 0, 0≦j≦K−1. If at least one entry in P_(M) returns 1, the entry with the smallest index is selected.

[0088] If the scheduling is successful in decision blocks 104 or 108, then the CP₁ will inform the BHP processor 82 of the selected outgoing data channel and the FDL identifier in blocks 105 or 109, respectively. After receiving an ACK from the BHP processor 82, the CP₁ 94 will update P_(G) 90 or P_(M) 94 or both. If scheduling is not successful, i is incremented in block 110, and P_(M) and P_(G) try to a time to schedule the data burst at a different delay. Once Q_(i) reaches the maximum delay (decision block 112), the processors P_(M) and P_(G) report that the data burst cannot be scheduled in block 114.

[0089] To speed up the scheduling process, the search can be performed in parallel. For example, if B=2 and three identical P_(M)'S and P_(G)'S are used, as shown in FIG. 5, one parallel search will determine whether the data burst can be sent out at times t_(′DB), t_(′DB)+Q₁, and t′_(DB)+Q₂. The smallest time is chosen in case that the data burst can be sent out at different times. In another example, if B=5 and three identical P_(M)'S and P_(G)'S are used, at most two parallel searches will determine whether the DB can be scheduled.

[0090] Some simplified versions of the LAUC-VF methods are listed below which could also be used in the implementation. First, an FF-VF (first fit with void filling) method could be used wherein the order of unscheduled times in P_(M) and gaps in P_(G) are not sorted in a given order (either descending or ascending order), and the first eligible data channel found is used to carry the data burst. Second, a LAUC (latest available unscheduled channel) method could be used wherein P_(G) is not used, i.e., no void filling is considered. This will further simplify the design. Third, a FF (first fit) method could be used. FF is a simplified version of FF-VF where no void filling is used.

[0091] The block diagram of the CCS module 86 is shown in FIG. 9. Similar to the DCS module 84, associative processor P_(T) 120 keeps track of the usage of the control channel. Since a maximum of P_(f) bytes of payload can be transmitted per frame, memory T 121 of P_(T) 120 tracks only the number of bytes available per frame (FIG. 10). Relative time is used here as well. The CCS module 86 has two bi-bit frame counters, C^(f) and C₁ ^(f). C^(f) counts the time frames. C₁ ^(f) records the elapsed frames since the receiving of the last BHP. Upon receiving a BHP with arrival time t_(DB), CP₂ 122 timestamps the frame during which this BHP is received, i.e., t_(sch) ^(f)←C^(f). In the meantime, it reads counter C₁ ^(f) (t_(e) ^(f)←C₁ ^(f)) and reset C₁ ^(f) to 0. It then updates the P_(T) by shifting B_(i)'s down by t_(e) ^(f) positions, i.e.,

=B_(i), t_(e) ^(f)≦i≦2^(b) ^(¹⁻) 1, and B_(i)=P_(f) for 2^(b) ^(₁) −t_(e) ^(f)≦i≦2^(b) ^(₁) −1. In the initialization, all the entries in P_(T) are set to P_(f) Next, CP₂ calculates the frame t_(DB) ^(f) during which the data burst will depart (assuming FDL Q_(i) is used) using

t _(DB) ^(f)(Q _(i))=└((t _(DB) +Q _(i))mod2^(b))/T _(f)┘, 0≦i≦B.  (6)

[0092] where T_(f) is the frame length in slots. The relative time frame that the DB will depart is calculated from

t′ _(DB) ^(f)(Q _(i))=(t _(DB) ^(f)(Q _(i))−t _(sch) ^(f)−2^(b) ^(₁) )mod2^(b) ^(₁) , 0≦i≦2.  (7)

[0093] The parameter bi can be estimated from parameter b₁ e.g., 2^(b) ^(₁) =2^(b)/T_(f) . When b=8 and T_(f)=4, b₁=6. The following method is used to search for the possible BHP departure time for a given DB departure time t (e.g., t=t′_(DB) ^(f) (Q₁)). The basic idea is to send the BHP as earlier as possible, but the offset time should be no larger than τ₀ (as described in connection with FIGS. 4a and 4 b). Let J=└τ₀/T_(f)┘. For example, when τ₀=8 slots and T_(f)=1 slot, J=8. When τ₀=8 slots and T_(f)=4 slots, J=2. Suppose the BHP length is X bytes.

[0094] In the preferred embodiment, a constrained earliest time (CET) method is used for scheduling the control channel, as shown in FIG. 11. In step 130, P_(T) 120 performs a parallel comparison of X (i.e., the length of a BHP) with the contents B_(t−j) of relevant entries of memory T 121, E_(t−j), 0≦j≦J−1 and t−j>0. If X≦B_(t−j), entry E_(t−j), returns 1, otherwise it returns 0. In step 132, if at least one entry in P_(T) returns 1, the entry with the smallest index is chosen in step 134. The index is stored and the CCS module 86 reports that a frame to send the BHP has been found. If no entry in P_(T) returns a “1”, then a negative acknowledgement is sent to the BHP processor 82. (step 136)

[0095] The actual frame t_(f) that the BHP will be sent out is (t_(DB) ^(f)−j+2^(b) ^(₁) )mod2^(b) ^(₁) if E_(t−j) is chosen. The new burst offset-time is (t_(DB)mod T_(f))+j·T_(f).

[0096] After running the CET method, the CCS module 86 sends the BHP processor 82 the information on whether the BHP can be scheduled and in which time frame it will be sent. Once it gets an ACK from the BHP processor 82, the CCS module 86 will update PT. For example, if the content in entry y needs to be updated, then B_(y)←B_(y)−X. If the BHP cannot be scheduled, the CCS module 86 will send a NACK to the BHP processor 82. In the real implementation, the contents in P_(T) do not have to move physically. A pointer can be used to record the entry index associated with the reference time frame 0.

[0097] For parallel scheduling, as discussed below, since the CCS module 86 does not know the actual departure time of the data burst, it schedules the BHP for all possible departure times of the data burst or a subset and reports the results to the BHP processor 82. When B=2, there are three possible data burst departure times, t′_(DB), t′_(DB)+Q₁ and t′_(DB)+Q₂ . Like the DCS module 84, if three identical PT's are used, as shown in FIG. 9, one parallel search will determine whether the BHP can be scheduled for the three possible data burst departure times.

[0098] A block diagram of the path & channel selector 44 is shown in FIG. 12. The function of the path & channel selector 44 is to control the access to the R×R RB switch 26 and to instruct the RB switch controller 28 and the spatial switch controller 30 to configure the respective switches 26 and 24. The path & channel selector 44 includes processor 140 coupled to a recirculation-buffer-in scheduling (RBIS) module 142, a recirculation-buffer-out scheduling (RBOS) module 144 and a queue 146. The RBIS module 142 keeps track of the usage of the R incoming channels to the RB switch 26 while the RBOS module 144 keeps track of the usage of the R outgoing channels from the RB switch 26. Any scheduling method can be used in RBIS and RBOS modules 142 and 144, e.g., LAUC-VF, FF-VF, LAUC, FF, etc. Note that RBIS module 142 and RBOS module 144 may use the same or different scheduling methods. From manufacturing viewpoint, it is better that the RBIS and RBOS module use the same scheduling method as the DCS module 84. Without loss of generality, it is assumed here that the LAUC-VF method is used in both RBIS and RBOS modules 142 and 144; thus, the design of DCS module can be reused can be used for these modules.

[0099] Assuming a data burst with duration l_(DB) arrives to the OSM at time t_(DB) and requires a delay time of Q_(i). The processor 140 triggers the RBIS module 142 and RBOS module 144 simultaneously. It sends the information of t_(DB) and l_(DB) to the RBIS module 142, and the information of time-to-leave the OSM (t_(DB) +Q_(i)) and l_(DB) to the RBOS module 144. The RBIS module 142 searches for incoming channels to the RB switch 26 which are idle for the time period of (t_(DB) , t_(DB) +l_(DB)). If there are two or more eligible incoming channels, the RBIS module will choose one according to LAUC-VF. Similarly, the RBOS module 144 searches for outgoing channels from the RB switch 26 which are idle for the time period of (t_(DB) +Q_(i), t_(DB) +l_(DB)+Q,). If there are two or more eligible outgoing channels, the RBOS module 144 will choose one according to LAUC-VF. The RBIS (RBOS) module sends either the selected incoming (outgoing) channel identifier or NACK to the processor. If an eligible incoming channel to the RB switch 26 and an eligible outgoing channel from the RB switch 26 are found, the processor will send back ACK to both RBIS and RBOS module which will then update the channel state information. In the meantime, it will send ACK to the scheduler 42 and the configuration information to the two switch controllers 28 and 30. Otherwise, the processor 140 will send NACK to the RBIS and RBOS modules 142 and 144 and a NACK to the scheduler 42.

[0100] The RBOS module 144 is needed because the FDL buffer to be used by a data burst is chosen by the scheduler 42, not determined by the RB switch 26. It is therefore quite possible that a data burst can enter the RB switch 26 but cannot get out of the RB switch 26 due to outgoing channel contention. An example is shown in FIG. 13, where three fixed-length data bursts 148 a-c arrive to the 2×2 RB switch 26. The first two data bursts 148 a-b will be delayed 2D time while the third DB will be delayed D time. Obviously, these three data bursts will leave the switch at the same time and contend for the two outgoing channels. The third data burst 148 c is lost in this example.

[0101] The BHP transmission module 46 is responsible for transmitting the BHP on outgoing control channel 52 in the time frame determined by the BHP processor 82. Since the frame payload is fixed, equal P_(f), in slotted transmission, one possible implementation is illustrated in FIG. 14, where the whole memory is divided into W_(c) segments 150 and BHPs to be transmitted in the same time frame are stored in one segment 150. W_(c) is the control channel scheduling window, which equals to 2^(b) ^(₁) . There is a memory pointer per segment (shown in segment W₀, pointing to the memory address where a new BHP can be stored. To distinguish BHPs within a frame, the frame overhead should contain a field indicating the number of BHPs in the frame. Furthermore, each BHP should contain a length field indicating the packet length (e.g., in bytes), from the first byte to the last byte of the BHP.

[0102] Suppose t_(c) is the current time frame during which the BHP is received by the BHP transmission module and p_(c) points to the current memory segment. Given the BHP departure time frame t_(f), the memory segment to store this BHP is calculated from (p_(c)+(t_(f)−t_(c)+2^(b) ^(₁) ) mod 2^(b) ^(₁) ) mod 2^(b) ^(₁) .

[0103]FIG. 15 shows the optical router architecture using passive FDL loops 160 as the recirculation buffer, where the number of recirculation channels R=R₁+R₂+ . . . +R_(B), with jth channel group introducing Q_(j) delay time, 1≦j≦B. Here the recirculation channels are differentiated while in FIG. 1b all the recirculation channels are equivalent, able to provide B different delays. The potential problem of using the passive FDL loops is the higher block probability .of accessing the shared FDL buffer. For example, suppose B=2, R=4 and R₁=2, R₂=2, and currently two recirculation channels of R₁ are in use. If a new DB needs to be delayed by Q₁ time, it may be successfully scheduled in FIG. 1b, as there are still two idle recirculation channels. However, it cannot be scheduled in FIG. 15, since the two channels able to delay Q₁ are busy.

[0104] The design of the SCU 32 is almost the same as described previously, except for the following changes: (1) the RBOS module 144 within the path & channel selector 44 (see FIG. 12) is no longer needed, (2) slight modification is required in the RBIS module 142 to distinguish recirculation channels if B>1. To reduce the blocking probability of accessing the FDL buffer when B>1, the scheduler is required to provide more than one delay option for each databurst that needs to be buffered. The impact on the design of scheduler and path & channel selector 44 is addressed below. Without loss of generality, it is assumed in the following discussion that the scheduler 42 has to schedule the databurst and the BHP for B+1 possible delays.

[0105] The design of DCS module 84 shown in FIG. 5 remains valid in this implementation. The search results could be stored in the format shown in Table 1 (assuming B=2), where the indicator (1/0) indicates whether or not an eligible data channel is found for a given delay, say Q_(i). The memory type (0/1) indicates P_(M) or P_(G). The entry index gives the location in the memory, which will be used for information update later on. The channel identifier column gives the identifiers of the channels found. The DCS module then passes the indicator column and the channel identifier column (only those with indicator 1) to the BHP processor. TABLE 1 Stored search results in DCS module (B = 2). Indicator Memory type Entry Index Max Channel identifier (1 bit) (1 bit) (log₂ G, log₂ K) bits (log₂ K bits) Q₀ Q₁ Q₂

[0106] The design of CCS module 86 shown in FIG. 9 also remains valid. The search results could be stored in the format shown in Table 2 (assuming B=2), where the indicator (1/0) indicates whether or not the BHP can be scheduled on the control channel for a given DB departure time. The entry index gives the location in the memory, which will be used for information update later on. The “frame to send BHP” column gives the time frames in which the BHP are scheduled to send out. The CCS module then passes the indicator column and the “frame to send BHP” column (only those with indicator 1) to the BHP processor. TABLE 2 Stored search results in CCS module (B = 2). Indicator Entry Index Frame to send BHP (1 bit) (b₁ bits) (b₁ bits) Q₀ Q₁ Q₂

[0107] After comparing the indicator columns from the DCS and CCS modules, the BHP processor 82 in FIG. 3 knows whether the data burst and its BHP can be scheduled for a given FDL delay Q₁, 1≦i≦B and determines which configuration information will be sent to the path & channel selector 44 in FIG. 12. The three possible scenarios are, (1) the data burst can be scheduled without using FDL buffer, (2) the data burst can be scheduled via using FDL buffer, and (3) the data burst cannot be scheduled.

[0108] In the third case, the data burst and its BHP are simply discarded. In the first case, the following information will be sent to the path & channel selector: incoming DCG identifier, incoming data channel identifier, outgoing DCG identifier, outgoing data channel identifier, data burst arrival time to the spatial switch, data burst duration, FDL identifier 0 (i.e. Q₀). The path & channel selector 44 will immediately send back an ACK after receiving the information. In the second case, the following information will be sent to the path & channel selector:

[0109] incoming DCG identifier,

[0110] incoming data channel identifier,

[0111] number of candidate FDL buffer x,

[0112] for (i=1 to x do)

[0113] outgoing DCG identifier,

[0114] outgoing data channel identifier,

[0115] FDL identifier i,

[0116] data burst arrival time to the spatial switch,

[0117] data burst duration.

[0118] In the second scenario, the path & channel selector 44 will search for an idle buffer channel to carry the data burst. The RBIS module 142 is similar to the one described in connection with FIG. 12, except that now it has a P_(M) and PG pair for each group of channels with delay Q_(i), 1≦i≦B. An example is shown in FIG. 16 for B=2, as an example. With one parallel search, the RBIS module will know whether the data burst can be scheduled. When x=1, the RBIS module 142 performs parallel search on (P_(M1) 90 a, P_(G1) 92 a) or (P_(M2) 90 b, P_(G2) 92 b), depending on which FDL buffer is selected by the BHP processor 82. If an idle buffer channel is found, it will inform the processor 140, which in turn sends an ACK to the BHP processor 82. When x=2, both (P_(M1), P_(G1)) and (P_(M2), P_(G2)) will be searched. If two idle channels with different delays are found, the channel with delay Q₁ is chosen. In this case, an ACK together with the information that Q₁ is chosen will be sent to the BHP processor 82. After a successful search, the RBIS module 142 will update the corresponding P_(M) and P_(G) pair.

[0119] FIGS. 17-26 illustrate variations of the LAUC-VF method, cited above. In the LAUC-VF method cited above, two associative processors P_(M) and P_(G) are used to store the status of all channels of the same outbound link. Specifically, P_(M) stores r words, one for each of the r data channels of an outbound link. It is used to record the unscheduled times of these channels. PG contains n superwords, one for an available time interval (a gap) of some data channel. The times stored in P_(M) and P_(G) are relative times. P_(M) and P_(G) support associative search operations, and data movement operations for maintaining the times in a sorted order. Due to parallel processing, P_(M) and P_(G) are used as major components to meet stringent real-time channel scheduling requirement.

[0120] In the embodiment described in FIGS. 22-23, a pair of associative processors P_(M) and P_(G) for the same outbound link are combined into one associative processor P_(MG). The advantage of using a unified P_(MG) to replace a pair of P_(M) and P_(G) is the simplification of the overall core router implementation. In terms of ASIC implementation, the development cost of a P_(MG) can be much lower than that of a pair of P_(M) and P_(G). P_(MG) can be used to implement a simpler variation of the LAUC-VF method with faster performance.

[0121] In FIGS. 17a and 17 b, two outbound channels Ch₁ and Ch₂ are shown, with to being the current time. With respect to t₀, channel Ch₁ has two DBs, DB₁ and DB₂, scheduled and channel Ch₂ has DB₃ scheduled. The time between DB₁ and DB₂ on Ch₁, which is a maximal time interval that is not occupied by any DB, is called a gap. The times labeled t_(i) and t₂ are the unscheduled time for Ch₁ and Ch₂, respectively. After t₁ and t₂, Ch₁ and Ch₂ are available for transmitting any DB, respectively.

[0122] The LAUC-VF method tries to schedule DBs according to certain priorities. For example, suppose that a new data burst DB₄ arrives at time t′. For the situation of FIG. 17a, DB₄ can be scheduled within the gap on Ch₁, or on Ch₂ after the unscheduled time of Ch₂. The LAUC-VF method selects Ch₁ for DB₄, and two gaps are generated from one original gap. For the situation of FIG. 17b, DB₄ conflicts with DB₁ on Ch₁ and conflicts with DB₃ on Ch₂. But by using FDL buffers, it may be scheduled for transmission without conflicting DBs on Ch₁ and/or DBs on Ch₂. FIG. 17b shows the scheduling that DB₄ is assigned to C₂, and a new gap is generated.

[0123] Assuming that an outbound link has r data channels, the status of this link can be characterized by two sets:

[0124] S_(M){(t_(i), i) | t_(i), is the unscheduled time for channel Ch_(i)}

[0125] S_(G){(l_(j), r_(j), c_(j)) | l_(j)<r_(j) and the interval [l_(j), r_(j)] is a gap on channel Ch_(cj)}

[0126] In the embodiment of LAUC-VF proposed in U.S. Ser. No. 09/689,584, the two associative processors P_(M) and P_(G) were proposed to represent S_(M) and S_(G), respectively. Due to fixed memory word length, the times stored in the associative memory M of P_(M) and the associative memory G of P_(G) are relative times. Suppose the current time is to. Then any time value less than to is of no use for scheduling a new DB. Let

S′ _(M)={(max{t _(i) −t ₀, 0}, i)|(t _(i) , i)∈S_(M)}

S′ _(G)={(max{l _(j) −t ₀, 0}, max{l _(j) −t ₀, 0}, c _(j))|(l _(j) , r _(j) , c _(j))∈S _(G)}

[0127] The times in S′_(M) and S′_(G) are times relative to the current time to, which is used as a reference point 0. Thus, M of P_(M) and G of P_(G) are actually used to store S′_(M) and S′_(G) respectively.

[0128] The channel scheduler proposed in U.S. Ser. No. 09/689,584 assumes that DBs have arbitrary lengths. One possibility is to assume a slot transmission mode. In this mode, DBs are transmitted in units of slots, and BHPs are transmitted as groups, and each group is carried by a slot. A slot clock CLK_(s) is used to determine the slot boundary. The slot transmissions are triggered by pulses of CLK_(s). Thus, the relative time is represented in terms of number of CLK_(s) cycles. The pulses of CLK_(s) are shown in FIG. 18. In addition to clock CLK_(s), there is another finer clock CLK_(f). The period of CLK_(s) is a multiple of the period CLK_(f). In X the example shown in FIG. 18, one CLK_(s) cycle contains sixteen CLK_(f) cycles. Clock CLK_(f) is used to coordinate operations performed within a period of CLK_(s).

[0129] In FIGS. 19a and 19 b, modifications to the hardware design of P_(M) and P_(G) given in U.S. Ser. No. 09/689,584 are provided for accommodation of slot transmissions. In P_(M), there is an associative memory M of r words. Each word M_(i) of M is essentially a register, and it is associated with a subtractor 200. A register MC holds an operand. In the embodiment of FIG. 19a, the value stored in MC is the elapsed time since the last update of M. The value stored in MC is broadcast to all words M_(i), 1≦i≦r. Each word M_(i) does the following: M_(j)←M_(j)−MC if M_(j)>MC; otherwise, M_(i)←0. This operation is used to update the relative times stored in M. If MC stores the elapsed time since last time parallel subtraction operation is performed, performing this operation again updates these times to the time relative to the time when this new PARALLEL-SUBTRACTION is performed. Another operation is the parallel comparison. In this operation, the value stored in MC is broadcast to all words M_(i), 1≦i≦r. Each word M_(i) does the following: if MC>M_(i) then MFLAG_(i)=1, otherwise MFLAG_(i)=0. Signals MFLAG_(i), 1≦i≦r, are transformed into an address by a priority encoder. This address and the word with this address are output to the address and data registers, respectively, of M. This operation is used to find a channel for the transmission of a given DB. Similarly, two subtractors are used for a word, one for each sub-word, of the associative memory G in P_(G).

[0130] An alternative design, shown in FIG. 19b, is to implement each word M_(i) in M as a decrement counter with parallel load. The counter is decremented by 1 by every pulse of the system slot clock CLK_(s). The counting stops when the counter reaches 0, and the counting resumes once the counter is set to a new positive value. Suppose that at time to the counter's value is t′ and at time t_(i)>t₀ the counters value is t″. Then t″ is the same time of t′, but relative to t_(i), i.e. t″=max{t′−(t₁−t₀), 0}. Note that any negative time (i.e. t′−(t_(i)−t₀)<0) with the new reference point t_(i) is not useful in the lookahead channel scheduling. Associated with each word M_(i) is a comparator 204. It is used for the parallel comparison operation. Similarly, a word of G in P_(G) can be implemented by two decrement counters with two associated comparators.

[0131] The system has a c-bit circular increment counter C_(s). The value of C_(s) is incremented by 1 by every pulse of slot clock CLK_(s). Let t_(latancy) (BPHP_(i)) be the time, in terms of number of S′_(G) cycles, between the time BHP_(i) is received by the router and the time BHP_(i) is received by the channel scheduler. The value c is chosen such that: $2^{c} > \left\lceil \frac{\max \quad {t_{latency}\left( {BHP}_{i} \right)}}{{MAX}_{s}} \right\rceil$

[0132] where MAX_(s) is the number of CLK_(f) cycles within a CLK_(s) cycle. When BHP_(i) is received by the router, BHP_(i) is timestamped by operations timestamp_(recv)(BHP_(i))←C_(s). When BHP_(i) is received by the scheduler of the router, BHP_(i) is timestamped again by timestamp_(sch)(BHP_(i))←C_(S). Let

D _(i)=(timestamp_(recv)(BHP _(i))+2^(c)−timestamp_(sch)(BHP _(i)))mod2^(c).

[0133] Then, the relative arrival time (in terms of slot clock CLK_(s)) of DB_(i) at the optical switching matrix of the router is T_(i)=Δ+τ_(i)+D_(i), where τ_(i) is the offset time between BHP_(i) and DB_(i), and D is the fixed input FDL time. Using the slot time at which timestamp_(sch)(BHP_(i))←C_(s) is performed as reference point, and the relative times stored in P_(M) and P_(G), DB_(i) can be correctly scheduled.

[0134] In the hardware implementation of LAUC-VF method, associative processors P_(M) and P_(G) are used to store and process S′_(M) and S′_(G), respectively. At any time, S′_(M) ={(t_(i), i)|1≦i≦r} and S′_(G) ={(l_(j), r_(j), c_(j))|l_(j)≧0}. A pair (t_(i), i) in S′_(M) represents the unscheduled time on channel Ch_(i), and a triple (l_(j), r_(j), c_(j)) in S′_(G) represents a time gap (interval) [l_(j), r_(j)] on channel Ch_(cj). The unscheduled time t_(i) can be considered as a semi-infinite gap (interval) [t_(i), ∞]. Thus, by including such semi-infinite gaps into S′_(G), S′_(M) is no longer needed.

[0135] More specifically, let S″_(M)={(t_(i), ∞, i)|(t_(i), i)∈S′_(M)}, and define S′_(MG)=S″_(M)∪S′_(G). The basic idea of combining P_(M) and P_(G) is to build P_(MG) by modifying P_(G) SO that P_(MG) is used to process S′_(MG). We present the architecture of associative processor P_(MG) for replacing P_(M) and P_(G). P_(MG) uses an associative memory MG to store pairs in S′_(M) and triples in S′_(G). As G in P_(G), each word of MG has two sub-words, with the first one for l_(j) and second one for r_(j) when it is used to store (1j, r_(j), c_(j)). When a word of MG is used to store a pair (t_(i), i) of S′_(M), the first sub-word is used fort_(i), and the second is left unused. The first r words are reserved for S′_(M), and the remaining words are reserved for S′_(G). The first r words are maintained in non-increasing order of their first sub-word. The remaining words are also maintained in non-increasing order of their first subword. New operations for P_(MG) are defined.

[0136] Below, the structures and operations of P_(M) and P_(G) are summarized, and the structure and operations of P_(MG) are defined. The differences between P_(MG) include the number of address registers used, the priority encoders, and operations supported. It is shown that P_(MG) can be used to implement the LAUC-VF method without any slow-down, in comparison with the implementation using P_(M) and P_(G).

[0137] The outbound data channel of a core router has r channels (wavelengths) for data transmission. These channels are denoted by Ch₁, Ch₂, Ch_(r). Let S={t_(i)|1≦i≦r}, where t_(i) is the unscheduled time for channel Ch_(i). In other words, at any time after t_(i), channel Ch₁ is available for transmission. Given a time T′, P_(M) is an associative processor for fast search of T″=min{t_(i)|t_(i)|t_(i)≧T′}, where T′ is a given time. Suppose that T″=t_(j), then channel Ch_(j) is considered as a candidate data channel for transmitting a DB at time T′.

[0138] For purposes of illustration, the structures of P_(M) and P_(G) are shown in FIGS. 20 and 21 and P_(MG) is shown in FIG. 22.

[0139] An embodiment of P_(M) 210 is shown in FIG. 20. Associative processor P_(M) includes an associative memory M 212 of k words, M₁, M₂, . . . , M_(k), one for each channel of the data channel group. Each word is associated with a simple subtraction circuit for subtraction and compare operations. The words are also connected as a linear array. Comparand register MC 214 holds the operand for comparison. MCH 216 is a memory of k words, MCH₁, MCH₂, . . . , MCH_(k), with MCH_(j) corresponding to M_(j). The words are connected as a linear array, and they are used to hold the channel numbers. MAR₁ 218 and MAR₂ 220 are address registers for holding addresses for accessing M and MCH. MDR 222 and MCHR 224 are data registers used to access M and MCHR along with the MARs.

[0140] Associative processor P_(M) supports the following major operations that are used in the efficient implementation of the LAUC-VF channel scheduling operations:

[0141] RANDOM-READ: Given address x in MAR₁, do MDR₁←M_(x), MCHR←MCH_(X).

[0142] RANDOM-WRITE: Given address x in MAR₁, do M_(x)←MDR, MCH_(x) ←MCHR.

[0143] PARALLEL-SEARCH: The value of MC is compared with the values of all word M₁, M₂, . . . , M_(k) simultaneously (in parallel). Find the smallest j such that M_(j)<MC, and do MAR₁←j, MDR₁←M_(j), and MCHR←MCH. If there does not exist any word M_(j) such that M_(j)<MC, MAR₁=0 after this operation.

[0144] SEGMENT-SHIFT-DOWN: Given addresses a in MAR₁, and b in MAR₂ such that a<b, perform M_(j+1)←M_(j) and MCH_(j)←MCH_(j) for all a≦j<b.

[0145] For RANDOM-READ, RANDOM-WRITE and SEGMENT-SHIFT-DOWN operations, each pair (M_(j), MCH_(j)) is treated as a superword. The output of PARALLEL-SEARCH consists r binary signals, MFLAG₁, 1≦i≦r. MFLAG_(i)1 if and only if M_(i)≦MC. There is a priority encoder with MFLAG_(i), 1≦i≦r, as input, and it produces an address j and this value is loaded into MAR₁ when PARALLEL-SEARCH operation is completed. RANDOM-READ, RANDOM-WRITE, PARALLEL-SEARCH and SEGMENT-SHIFT-DOWN operations are used to maintain the non-increasing order of values stored in M.

[0146]FIG. 21 illustrates a block diagram of the associative processor P_(G) 92. A P_(G) is used to store unused gaps of all channels of an outbound link of a core router. A gap is represented by a pair (l, r) of integers, where l and r are the beginning and the end of the gap, respectively. Associative processor P_(G) includes associative memory G 93, comparand register GC 230, memory GCH 232, address register GAR 234, data registers GDR 236 and GCHR 238 and working registers GR₁ 240 and GR₂ 242.

[0147] G is an associative memory of n words, G₁, G₂, . . . , G_(n), with each G_(i) consisting of two sub-words G_(i,1) and G_(i,2). The words are connected as a linear array. GC holds a word of two sub-words, GC₁ and GC₂. GCH is a memory of n words, GCH₁, GCH₂, . . . , GCH_(n) with GCH_(j) corresponding to G_(j). The words are connected as a linear array, and they are used to hold the channel numbers. GAR is an address register used to hold address for accessing G. GDR, and GCHR are data registers used to access M and MCHR, together with GAR.

[0148] Associative processor P_(G) supports the following major operations that are used in the efficient implementation of the LAUC-VF channel scheduling operations:

[0149] RANDOM-WRITE: Given address x in GAR, do G_(x,1)←GDR₁←G_(x,2)←GDR₂, GCH_(x)←GCHR.

[0150] PARALLEL-DOUBLE-COMPARAND-SEARCH: The value of GC is compared with the values of all word G1,G2, . . . , Gn simultaneously (in parallel). Find the smallest j such that G_(j,1)<GC₁ and G_(j,2)>GC₂. If this operation is successful, then do GDR₁←G_(j,1), GDR₂←G_(j,2), GCHR←GCH_(j), and GAR←j; otherwise, GAR←0.

[0151] PARALLEL-SINGLE-COMPARAND-SEARCH: The value of GC₁ is compared with the values of all word G1,G2, . . . ,G_(n) simultaneously (in parallel). Find the smallest j such that G_(j,1)>GC₁ and j in a register GAR. If this operation is successful, then do GDR₁←G_(j,1), GDR₂←G_(j,2), GCHR←GCH_(j), and GAR←j; otherwise, GAR←0.

[0152] BIPARTITION-SHIFT-UP: Given address a in GAR, shift the content of G_(j+1) to G_(j)←G_(j+1), GCH_(j)←GCH_(j+1), GCH_(j) to GCH_(j+1) for a ≦j≦n, and G_(n,1)←0, G_(n,2)←0.

[0153] BIPARTITION-SHIFT-DOWN: Given address a in GAR, do G_(j+1)←G_(j), GCH_(j+1)←GCH_(j), a≦j<n.

[0154] In P_(G), a triple (G_(i,1), G_(i,2,)GCH_(i)) corresponds to a gap with beginning time G_(i,1) and ending time G_(i,2) on channel GCH₁. For RANDOM-WRITE, PARALLEL-DOUBLE-COMPARAND-SEARCH, PARALLEL-SINGLE-COMPARAND-SEARCH, BIPARTITION-SHIFT-UP, and BIPARTITION-SHIFT-DOWN operations, each triple (G_(i,1), G_(i,2), GCH_(i)) is treated as a superword. The output of PARALLEL-DOUBLE-COMPARAND-SEARCH (resp. PARALLEL-SINGLE-COMPARAND-SEARCH) operation consists n binary signals, GFLAG_(i), 1≦i≦n, such that GFLAG_(i)=1 if and only if G_(i,l)≧GC₁ and G_(i,2)≦GC₂ (resp. G_(i,1)≧GC₁). There is a priority encoder with GFLAG_(i)=1≦i≦n, as input, and it produces an address j and this value is loaded into GAR₁ when the operation is completed. RANDOM-WRITE, PARALLEL-SINGLE-COMPARAND-SEARCH, BIPARTITION-SHIFT-UP, and BIPARTITION-SHIFT-DOWN operations maintain the non-increasing order of values stored in G_(i,1)s.

[0155] The operations of P_(M) and P_(G) are discussed in greater detail in U.S. Ser. No. 09/689,584.

[0156]FIG. 22 illustrates a block diagram of a processor P_(MG), which combines the functions of the P_(M) and P_(G) processors described above. P_(MG) includes associative memory MG 248, comparand register MGC 250, memory MGCH 252, address registers MGAR₁ 254 a and MGAR₂ 254 b, and data registers MGDR 256 and MGCHR 258.

[0157] MG is an associative memory of m=r+n words, MG_(1, MG) ₂, . . . , MG_(m), with each MG_(i) consisting of two sub-words MG_(i,1) and MG_(i,2). The words are also connected as a linear array. MGC is a comparand register that holds a word of two sub-words, MGC₁ and MGC₂. MGC also holds a word of two sub-words, MGC₁ and MGC₂. MGCH is a memory of m words, MGCH₁, MGCH₂,. . . , MGCH_(m) with MGCH_(j) corresponding to MG_(j). The words are connected as a linear array, and they are used to hold the channel numbers.

[0158] Associative processor P_(MG) supports the following major operations:

[0159] RANDOM-READ: Given address x in MGAR₁, do MGDR₁ F MG_(i,1), MGDR₂←MG_(i,2), MCHR←MGCH_(x).

[0160] RANDOM-WRITE: Given address x in MGAR, do MGX_(x,1)←MGDR₁, MG_(x,2)←MGDR₂, MGCH_(x)←MGCHR.

[0161] PARALLEL-COMPOUND-SEARCH: In parallel, the value of MGC₁ is compared with the values of all superwords MG_(i), 1≦i≦m, and the values of MGC₁ and MGC₂ are compared with all super words MG_(j), r +1≦j≦m, in parallel. (i) If MGC₂≠0, then do the following in parallel: Find the smallest j′ such that j′≦r and MG_(j′), 1<MGC₁. If this search is successful, then do MGAR₁←j′, otherwise, MGAR₁←0. Find the smallest j″ such that r+1≦j″≦m, MG_(j,1)<MGC₁ and MG_(j,2)>MGC₂. If this search is successful, then do MGAR₂←j′ and MGCHR←MGCH_(i); otherwise MGAR₂←0. (ii) If MGC₂=0, then find the smallest j′ such that 1≦j′<m and MG_(j),1<MGC₁. MG_(j,2)>MGC₂. If this search is successful, then do MGAR₁←j″ and MGCHR←MGCH_(i); otherwise MGAR₁←0.

[0162] BIPARTITION-SHIFT-UP: Given address a in MGAR₁, do MG_(j)←MG_(j+1), MGCH_(j)←MGCH_(j+1), MGCH, to MGCH_(j+1) for a≦j≦m, and MG_(n,1)←0, MG_(n,2)←0.

[0163] SEGMENT-SHIFT-DOWN: Given addresses a in MGAR₁, and b in MGAR₂ such that a<b, perform MG_(j+1)←MG_(j) and MGCH_(j+1)←MGCH_(j) for all a≦j<b.

[0164] As in P_(G), a triple (MG_(i,1), MG_(i,2), MGCH_(i)) may correspond to a gap with beginning time MG_(i,1) and ending time MG_(i,2) on channel MGCH_(i). But in such a case, it must be that i>r. If i≦r, then MG_(i,2) is immaterial, the pair (MG_(i,1), MGCH_(i)) is interpreted as the unscheduled time MG_(i,1) on channel MGCH_(i), and this pair corresponds to a word in P_(M). For RANDOM-READ, RANDOM-WRITE, PARALLEL-COMPOUND-SEARCH, BIPARTITION-SHIFT-UP and SEGMENT-SHIFT-DOWN operations, each triple (MG_(i,1), MG_(i2), . . . , MGCH_(i)) is treated as a superword. The first r superwords are used for storing the unscheduled times of r outbound channels, and the last m−r superwords are used to store information about gaps on all outbound channels.

[0165] The output of PARALLEL-COMPOUND-SEARCH operation consists of binary signals MGFLAG_(i) whose values are defined as follows: (i) if MGC₂=0 and MG_(i,1)≧MGC₁ then MGFLAG_(i)=1; (ii) if MGC₂×0, i≦r, MG_(i,1)≧MGC₁ then MGFLAG_(i)=1; (iii) if MGC₂×0, i>r, MG_(i,1) ≧MGC₁ and MG_(i,2)≦MGC₂ then MGFLAG₁=1, or if MG_(i,1)>MGC₁ and i≦r then MGFLAG_(i)=1; and (iv) otherwise, MGFLAG₁=0.

[0166] There are two encoders. The first one uses MGFLAG₁, 1≦i≦r, as its input, and it produces an address in MGAR₁ after a PARALLEL-COMPOUND-SEARCH operation is performed if MGC₂≠0. The second encoder uses MGFLAG_(i), r+1≦i≦m, as its input. It produces an address in MGAR₂ after a PARALLEL-COMPOUND-SEARCH operation is performed if MGC₂≠0. There is a selector with the output of the two encoders as its input. If MGC₂=0, the smallest non-zero address produced by the two encoders, if such an address exists, Ys loaded into MGAR₁ after a PARALLEL-COMPOUND-SEARCH operation is performed; otherwise, MGAR₁ is set to 0 after a PARALLEL-COMPOUND-SEARCH operation is performed; If MGC₂≠0 the output of the selector is disabled.

[0167] RANDOM-READ, RANDOM-WRITE, PARALLEL-COMPOUND-SEARCH1, BIPARTITION-SHIFT-UP and SEGMENT-SHIFT-DOWN operations are used to maintain the non-increasing order of values stored in MG_(i,1) of the first m words, and the non-increasing order of the values stored in MG_(i,1) of the last m−r words.

[0168] The operations of associative processors P_(M) and P_(G) can be carried out by operations of P_(MG) without any delay when they are used to implement LAUC-VF channel scheduling method. We assume that P_(MG) contains m=r+n superwords. In Table 3 (resp. Table 4), the operations of P_(M) (resp. PG) given in the left column are carried out by operations of P_(MG) given the right column. Instead of searching P_(M) and P_(G) concurrently, using P_(MG), this step can be carried out by PARALLEL-COMPOUND-SEARCH operation. TABLE 3 Simulation of P_(M) by P_(MG) P_(M) P_(MG) RANDOM-READ RANDOM-READ RANDOM-WRITE RANDOM-WRITE PARALLEL-SEARCH PARALLEL-COMPOUND- SEARCH SEGMENT-SHIFT- SEGMENT-SHIFT-DOWN DOWN (with_MGAR₂ = m)

[0169] TABLE 4 Simulation of P_(M) by P_(MG) P_(G) P_(MG) RANDOM-WRITE RANDOM-WRITE PARALLEL-DOUBLE-COMPARAND- PARALLEL-COMPOUND- SEARCH SEARCH PARALLEL-SINGLE-COMPARAND- PARALLEL-COMPOUND- SEARCH SEARCH (with MGC₂ = 0) BIPARTITE-SHIFT-UP SEGMENT-SHIFT-UP (with MGAR₂ = m) BIPARTITE-SHIFT-DOWN SEGMENT-SHIFT-DOWN (with MGAR₂ = m − 1)

[0170] In the LAUC-VF method, fitting a given DB into a gap is preferred, even the DB can be scheduled on another channel after its unscheduled time, as shown by the example of FIGS. 17a-b. With separate P_(M) and P_(G) and performing search operations on P_(M) and P_(G) simultaneously, this priority is justifiable. However, the overall circuit for doing so may be considered too complex.

[0171] By combining P_(M) and P_(G) into one associative processor, simpler and faster variations of this LAUC-VF methods are possible. An alternative embodiment is shown in FIG. 23. In this figure, processor P*_(MG) 270 includes an array TYPE 272 with m bits, each bit being associated with a corresponding word in memory MG. If TYPE_(i)=1 then MG_(i) stores an item of S′_(M) otherwise, MG, stores an item of S′_(G). Further, register TYPER 274 is a one-bit register used to access TYPE, together with MGAR₁ and MGAR_(2.)

[0172] Other differences between P*_(MG) and P_(MG) include the priority encoder used and the operations supported. When a new DB is scheduled, MG is searched. The fitting time interval found, regardless if it is a gap or a semi-infinite interval, will be used for the new DB. Once the DB is scheduled, one more gap may be generated. As long as there is sufficient space in MG, the new gap is stored in MG. When MG is full, an item of S′_(G) may be lost. But it is enforced that all items of S′_(M) must be kept.

[0173] Let t_(s) ^(out)(DB_(i)) and t_(e) ^(out)(DB_(i)) be the transmitting time of the first and last slot of DB_(i) at the output of the router, respectively. Then

t _(s) ^(out)(DB _(i))=T _(i) + _(L) _(j)

[0174] and

t _(e) ^(out)(DB _(i))=T _(i) +L _(j)+length(DB _(i)),

[0175] where T_(i) is the relative arrival time defined above, L_(j) is the FDL delay time selected for DB_(i) in the switching matrix and length(DB2) is the length of DB_(i) in terms of number of slots. Assume that there are q+1 FDLs L₀, L₁, . . . , L_(q) in the DB switching matrix such that L₀=0<L₁<L₂< . . . <L_(q−i)<L_(q). The new variation of LAUC-VF is sketched as follows: method CHANNEL-SCHEDULING begin success ← 0; for j = 0 to q do MGC₁ ←T_(l)+L_(j) MGC₂ ← T_(l) + L_(j) + length(DB_(i)); perform PARALLEL-COMPOUND-SEARCH using P*_(MG); if MGAR₁ ≠ 0 then if MGAR₁ ≠ 0 then begin output MGCHR as the number of the channel for transmitting DB_(i) output L_(j) as the selected FDL delay time for DB_(i); update MG of P* _(MG) using the values in MGC₁ and MGC₂ success ← 1; exit /* exit the for-Loop */ end endfor if success = 0 then drop DB_(i)/* scheduling for DB_(i) is failed */ end

[0176] Once a DB is scheduled, MG is updated. When a gap is to be added into MG, and TYPE_(m)=1, the new gap is ignored. This ensures that no item belonging to S′_(M) is lost.

[0177] Associative processor P*_(MG) supports the following major operations:

[0178] RANDOM-READ: Given address x in MGAR₁, do MGDR₁←MG_(i,1), MGDR₂←MG_(i,2), MCHR←MGCH_(x), TYPER←TYPE_(x).

[0179] RANDOM-WRITE: Given address x in MGAR₁, do MG_(x,1)←MGDR₁, MG_(x,2)←MGDR₂, MGCH_(x)←MGCHR, TYPE_(x)←TYPER.

[0180] PARALLEL-COMPOUND-SEARCH: The value of MGC₁ is compared with the values of all superwords MG_(i), 1≦i≦m, and MGC₂ are compared with all super words MG_(i), 1≦i≦m, whose TYPEi=0, in parallel. Find the smallest j′ such that TYPE_(j′)=1 and MG_(j′,1)<MGC₁, or TYPE_(j)=0, MG_(j,1)<MGC₁ and MG_(j,2)>MGC₂. If this search is successful, then do MGAR₁←j′, TYPER←TYPE_(j′), MGCH←MGCH_(j′); otherwise, otherwise MGAR₁←0.

[0181] BIPARTITION-SHIFT-UP, SEGMENT-SHIFT-DOWN: same as in P_(MG).

[0182] In operation, The value of TYPE_(i) indicates the type of information stored in MG_(i). As in P_(G), a triple (MG_(i,1), MG_(i,2), MGCH_(i)) may correspond to a gap with beginning time MG_(i,1) and ending time MG_(i,2) on channel MGCH₁. But in such a case, it must be that TYPE₁=0. If TYPE₁=1, then MG_(i,2) is immaterial, the pair (MG_(i,1), MGCH_(i)) is interpreted as the unscheduled time MG_(i,1) on channel MGCH_(i) and this pair corresponds to a word in P_(M). For RANDOM-READ, RANDOM-WRITE, PARALLEL-COMPOUND-SEARCH, BIPARTITION-SHIFT-UP and SEGMENT-SHIFT-DOWN operations, each quadruple (MG_(i,1), MG_(i,2), TYPE_(i), MGCH_(i)) is treated as a superword.

[0183] The output of PARALLEL-COMPOUND-SEARCH operation consists of binary signals MGFLAG_(i) whose values are defined as follows. If MGC₂≈0, TYPE=0, G_(i,1)≧GC₁ and G_(i,2) >GC₂ then MGFLAG_(i)=1. If MGC₂=0 and G_(i,1)≧GC, then MGFLAG_(i)=1. Otherwise, MGFLAG_(i)=0. There is a priority encoders. If MGFLAG_(i), 1≦i≦m, as its input, and it produces an address in MGAR₁ after a PARALLEL-COMPOUND-SEARCH operation is performed.

[0184] RANDOM-READ, RANDOM-W7RITE, PARALLEL-COMPOUND-SEARCH, BIPARTITION-SHIFT-UP and SEGMENT-SHIFT-DOWN operations are used to maintain the non-increasing order of values stored in MG_(i,1)s.

[0185]FIG. 24 illustrates the use of multiple associative processors for fast scheduling. Channel scheduling for an OBS core router is very time critical, and multiple associative processors (shown in FIG. 24 as P*_(MG) processors 270), which are parallel processors, are proposed to implement scheduling methods. Suppose that there are q+1 FDLs L₀=0, L₁, . . . , L_(q) in the DB switching matrix such that L₀ <L₁ < . . . <L_(q). These FDLs are used, when necessary, to delay DBs and increase the possibility that the DBs can be successfully scheduled. In the implementation of the LAUC-VF method presented in U.S. Ser. No. 09/689,584, the same pair of P_(M) and P_(G) are searched repeatedly using different FDLs until a scheduling solution is found or all FDLs are exhausted. The method CHANNEL-SCHEDULING described above uses the same idea.

[0186] To speed up the scheduling, a scheduler 42 may use q+1 P_(M)/P_(G) pairs, one for each L_(i). At any time, all q+1 Ms have the same content, all q+1 MCHs have the same content, all q+1 Gs have the same content, and all q+1 GCHs have the same content. Then finding a scheduling solution for all different FDLs can be performed on these P_(M)/P_(G) pairs simultaneously. At most one search result is used for a DB. All P_(M)/P_(G) pairs are updated simultaneously by the same lock-step operations to ensure that they store the same information. Similarly, one may use q+1 P_(MG) _(^(S)) or P*_(MG) _(^(S)) to speed up the scheduling.

[0187] In FIG. 24, a multiple processor system 300 uses q+1 P*_(MG) _(^(S)) 270 implement the method CHANNEL-SCHEDULING described above. Similarly, the LAUC-VF method can be implemented using multiple P_(M)/P_(G) pairs, or multiple P_(MG) _(^(S)) in a similar way to achieve better performance. The multiple P*_(MG) _(^(S)) 270 include q+1 associative memories MG⁰, MG¹, . . . , MG^(q). Each MG^(i) has m words MG^(j) ₁, MG^(j) ₂, . . . , MG^(j) _(m), with each MG^(j) ₁ consisting of two sub-words MG^(j) _(i,1) and MG^(j) _(i,2). There are q+1 comparand registers MGC⁰, MGC¹, . . . , MGC^(q). Each MGC^(j) holds a word of two sub-words, MGC^(j) ₁, and MGC^(j) ₂. MGCHs: There are q+1 associative memories MGCH⁰, MGCH¹ . . . , MGCH^(q). Each MGCH^(j) has m words, MGCH^(j) ₁, MGCH^(j) ₂, . . . , MGCH^(j) _(m). The words in MGCH^(j) are connected as a linear array. There are q+1 linear arrays TYPE⁰, TYPE¹, . . . , TYPE^(q), where TYPE^(j) has m bits, TYPD^(j) ₁, TYPE^(j) ₂ . . . , TYPE^(j) _(m). MGAR₁, MGAR are address registers used to hold address for accessing MG and MGCH. MGDR, TYPER, MGCH are: data registers used to access MGs, TYPEs and MGCHR.

[0188] This multiple processor system 300 supports the following major operations:

[0189] RANDOM-READ: Given address x in MGAR₁, do MGDR₁←MG⁰ _(t,1), MGDR₂←MG⁰ _(i,2), MCHR←MGCH⁰ _(x), TYPER←TYPE⁰ _(x).

[0190] RANDOM-WRITE: Given address x in MGAR₁, do MG^(j) _(x,1)←MGDR₁, MG^(j) _(x,2)←MGDR₂, MGCH^(j) _(x)←MGCHR, TYPE^(j) _(x)←TYPER, for 0≦j≦q.

[0191] PARALLEL-COMPOUND-SEARCH: For 0≦j≦q, the value of MGC^(j) ₁ is compared with the values of all superwords MG^(j) ₁, 1≦i≦m, and MG^(j) ₂ are compared with all super words MG^(j) _(i), 1≦i≦m, whose TYPE^(j) _(i)=0. in parallel. For 0≦j≦q, find the smallest k_(j), such that TYPE^(j) _(kj,1)=1 and MG^(j) _(kj,1)<MGC^(j) ₁, or TYPE^(j) _(kj.)=0, MG^(j) _(kj,1)<MGC^(j) ₁ and MG^(j) _(kj,2)>MGC^(j) ₂. If this search is successful, let l_(j)=1; otherwise let l_(j)=0. Find FD=min{j|l_(j)=1,0≦j≦q}. If such l exists, then do j←FD, MGAR₁←k_(j), TYPER←TYPD^(j) _(kj.), MGCH←MGCH^(j) _(kj.); otherwise, otherwise MGAR₁←0.

[0192] BIPARTITION-SHIFT-UP: Given address a in MGAR₁, for 0≦j≦q, do MG^(j) _(i)←MG^(j) _(i+1) MGCH^(j) _(i)←MGCH^(j) _(i+1), MGCH^(j) _(i) to MGCH^(j) _(i+1), for a≦i<m, and MG^(j) _(n,1)←0, MG^(j) _(n,2)←0.

[0193] SEGMENT-SHIFT-DOWN: Given addresses a in MGAR₁, and b in MGAR₂ such that a<b, for 0≦j≦q do MG^(j) _(i)←MG^(j) _(i+1) and MGCH^(j) _(i+1)←MGCH^(j) _(i) for all a≦b.

[0194] A RANDOM-READ operation is performed on one copy of P*_(MG), i.e. MG⁰, TYPE⁰, and MG⁰. RANDOM-WRITE, PARALLEL-COMPOUND-SEARCH, BIPARTITION-SHIFTUP and SEGMENT-SHIFT-DOWN operations are performed on all copes of P*_(MG). For RANDOM-READ, RANDOM-WRITE, PARALLEL-COMPOUND-SEARCH, BIPARTITION-SHIFT-UP and SEGMENT-SHIFT-DOWN operations, each quadruple (MG_(i,1), MG_(i,2), TYPE_(i), MGCH_(i)) is treated as a superword. When a PARALLEL-COMPOUND-SEARCH operation is performed, the output of all P*_(MG) copies are the input of selectors. The output of one P*_(MG) copy is selected.

[0195] The CHANNEL-SCHEDULING method may be implemented in the multiple processor system as: method PARALLEL-CHANNEL-SCHEDULING begin success ← 0; for j = 0 to q do in parallel MGC^(j) ₁ ← T_(l)+L_(j) MGC^(j) ₂ ← T_(l)+L_(j)+ length(DB_(i)); endfor perform PARALLEL-COMPOUND-SEARCH; if MGAR₁ ≠ 0 then begin output MGCHR as the number of the channel for transmitting DB₂; output L, as the selected FDL delay time for DB_(i) k ≦ FD; for j = 0 to q do in parallel update MG^(j), 0 ≦j ≦q, using the values in MGCDR^(k) ₁ and MGDR^(k) ₂ endfor success ← 1; end if success = 0 then drop DB_(i) /* scheduling for DB_(l) is failed */ end

[0196] It may be desirable to be able to partition the r data channels into groups and choose a particular group to schedule DBs. Such situations may occur in several occasions. For example, one may want to test a particular channel. In such a situation, the channel to be tested by itself forms a channel group, and all other channels form another group. Then, channel scheduling is only performed on the 1-channel group. Another occasions is that during the operation of the router, some channels may fail to transmit DBs. Then, the channels of the same outbound link can be partitioned into two groups, the group that contains all failed channels, and the group that contains all normal channels, and only normal channels are to be selected for transmitting DBs. Partitioning data channels also allows channel reservation, which has applications in quality of services. Using reserved channel groups, virtual circuits and virtual networks can be constructed.

[0197] To incorporate group partition feature into channel scheduling associative processors, the basic idea is to associate a group identifier (or gid for short) with each channel. For a link, all the channels share the same gid belong to the same group. The gid of a channel is programmable; i.e. it can be changed dynamically according to need. The gid for a DB can be derived from its BHP and/or some other local information.

[0198] The design of P_(M) and P_(G) to P_(M-ext) and P_(G-ext) may be extended to incorporate multiple channel groups, as shown in FIGS. 25 and 26, respectively. As shown in FIG. 25, associative processor P_(M-ext) 290 includes M, MC, MCH, MAR₁, MAR₂, MDR, MCHR, as described in connection with FIG. 20. MCIDC 292 is a comparand register that holds the gid for comparison. MGID 294 is a memory of r words, MGID₁, MGID₂,. ., MGID_(r), with MGID₁ corresponding to M_(j) and MCH_(j). The words are connected as a linear array, and they are used to hold the channel group numbers. MGIDDR 296 is a data register.

[0199] P_(M-ext) is similar to P_(M) with the addition of several components, and modifying operations. The linear array MGID has r locations, MGID₁, MGID₂. . . , MGIDr; each is used to store an integer gid. MGID, is associated with M_(i) and MCH_(i), i.e. a triple (M_(i), MCH_(i), MGID_(i)) is treated as a superword. Comparand register MGIDC and data register MGIDDR are added.

[0200] Associative processor P_(M-ext) supports the following major operations that are used in the efficient implementation of the LAUC-VF channel scheduling operations.

[0201] RANDOM-READ: Given address x in MAR₁, do MDR←M_(x), MCH_(x)←MCHR and GIDR←MGID_(x).

[0202] RANDOM-WRITE: Given address x in MAR₁, do M_(x)←MDR, MCH_(x)←MCHR and MGID_(i)←MGIDDR.

[0203] PARALLEL-SEARCH1: Simultaneously, MGIDC is compared with the values of MGID₁, MGID₂, . . . , MCIDr). Find j such that MGID, =MGIDC, and do MAR₁←j, MDR₁←M_(j), MCHR←MCH_(j), and MGIDDR←MGID+j.

[0204] PARALLEL-SEARCH2: Simultaneously, (MC, MGIDC) is compared with (M₁, MGID₁), (M₂, MGID₂), . . . , (M_(r), MGID₂) Find the smallest j such that M_(j) <MC and MGID_(J)=GIDC, and do MAR₁←j, MDR₁←M_(j), MCHR←MCH_(j), and MGIDDR←MGID₁. If there does not exist any word (M_(j), MGID_(j)) such that M_(j) >MC and MGID_(j)=GIDC, MAR₁=0 after this operation.

[0205] SEGMENT-SHIFT-DOWN: Given addresses a in MAR₁, and b in MAR₂ such that a<b, perform M_(j), ←M_(j+1)←MCH_(j)<MCH_(j) and MGID_(j+1)←MGID_(j). for all a≦j<b.

[0206] For RANDOM-READ, RANDOM-WRITE and SEGMENT-SHIFT-DOWN operations, each triple (M_(j), MCH_(j), MGID_(j)) is treated as a superword. The output of PARALLEL-SEARCH1 consists r binary signals, MFLAG_(i), 1≦i≦r. MFLAG_(i)=1 if and only if MGID₁=MGIDC. There is a priority encoder with MFLAG_(i), 1≦i≦r, as input, and it produces an address j and this value is loaded into MAR₁ when PARALLEL-SEARCH1 operation is completed. The output of PARALLEL-SEARCH2 consists r binary signals, MFLAG_(i), 1≦i≦r. MFLAG_(i)=1 if and only if M_(i)≦MC and MGID₁=MGIDC. The same priority encoder used in PARALLEL-SEARCH 1 transforms MFLAG_(i), 1≦i≦r, into an address j and this value is loaded into MAR₁ when PARALLEL-SEARCH operation is completed. RANDOM-READ, RANDOM-WRITE, PARALLEL-SEARCH2 and SEGMENT-SHIFT-DOWN operations are used to maintain the non-increasing order of values stored in M.

[0207]FIG. 26 illustrates a block diagram of P_(G-ext). P_(G-ext) 300 includes G,GC,GCH,GAR,GDR,GCHR, as described in connection with FIG. 21. GGIDC 302 is a comparand register for holding the gid for comparision. GGID 304 is a memory of r words, GGID₁, GGID₂, . . . , GGID_(r), with GGID_(j) corresponding to G_(j) and GCH_(j). The words are connected as a linear array, and they are used to hold the channel group numbers. GGIDR 306 is a data register.

[0208] Similar to the architecture of P_(M-ext), a linear array GGID of n words, GGID₁, GGID₂, . . . , GGID, is added to P_(G). A quadruple (G_(i,1), G_(i,2), MCH_(i), GGID_(i)) is treated as a superword.

[0209] Associative processor P_(G-ext) supports the following major operations that are used in the efficient implementation of the LAUC-VF channel scheduling operations.

[0210] RANDOM-WRITE: Given address x in GAR, do G_(x,)←GDR₁, G_(x,2)←GDR₂, GCH_(x)←GCHR, GGID_(x)←GGIDR.

[0211] PARALLEL-DOUBLE-COMPARAND-SEARCH: The value of (GC, GGIDC) is compared with (G₁, GGID₁), (G2, GGID₂), . . . , (G_(n), GGID_(n)) simultaneously (in parallel). Find the smallest j such that G_(j,1)>GC₁, G_(j,2)>GC₂ and GGID_(j)=GGIDC. If this operation is successful, then do GDR₁←G_(j,1), GDR₂←G_(j,2), GCHR←GCH_(j), GGIDR←GGID_(j) and GAR←j; otherwise, GAR←0.

[0212] PARALLEL-SINGLE-COMPARAND-SEARCH: (GC₁, GGIDC) is compared with (G_(1,1), GGID₁), (G_(2,1), GGID₂), . . . , (G_(n,1), GGID_(n),) simultaneously (in parallel). Find the smallest j such that G_(j,1)>GC₁ and GGID_(j)=GGIDC. If this operation is successful, then do GDR₁←G_(j,1), GDR₂←G_(j,2), GCHR←GCH_(j), GGIDR←GGID, and GAR←j; otherwise, GAR←0.

[0213] BIPARTITION-SHIFT-UP: Given address a in GAR, shift the content of G_(j+1) to G_(j)←G_(j+1), GCH_(j)←GCH_(j+1), GCH_(j) to GCH_(j+1), GGID, to GGID_(j+1), for a≦j≦n, and G_(n,1)←0, G_(n,2)←0.

[0214] BIPARTITION-SHIFT-DOWN: Given address a in GAR, do G_(j+1)←G_(j), GCH_(j+1)←CCH_(j), GGID_(j+1) ←GCID_(j), a≦j≦n.

[0215] A quadruple (G_(i,1), G_(i,2), GCH_(i), GGID_(i)) corresponds to a gap with beginning time GC_(i,1) and ending time G_(i,2) on channel CCH_(i), whose gid is in GGID_(i). For RANDOM-WRITE, PARALLEL-DOUBLE-COMPARAND-SEARCH, PARALLEL-SINGLE-COMPARAND-SEARCH, BIPARTITION-SHIFT-UP, and BIPARTITION-SHIFT-DOWN operations, each quadruple (GCr₁, GCr₂, GCHZ, GGID₁) is treated as a super-word. The output of PARALLEL-DOUBLE-COMPARAND-SEARCH (resp. PARALLEL-SINGLECOMPARAND-SEARCH) operation consists n binary signals, GFLAG_(i), 1≦i≦n, such that GFLAG_(i)=1 if and only if G_(i,1)≧GC₁ and G_(i,2)≦GC₂ (resp. G_(i,1)≧GC₁), GGID_(i)=GGIDC. There is a priority encoder with GFLAG_(i), 1≦i≦n, as input, and it produces an address j and this value is loaded into GAR, when the operation is completed. RANDOM-WRITE, PARALLEL-SINGLE-COMPARAND-SEARCH, BIPARTITION-SHIFT-UP, and BIPARTITION-SHIFT-DOWN operations to maintain the non-increasing order of values stored in G_(i,1)s.

[0216] Changing the gid of a channel Ch_(j) from g₁ to g₂ is done as follows: find the triple (M_(i), MCH_(i), MGID_(i)) such that MCH_(i), =j and store i into MARI and (MDR, MCHR, MGIDDR); MGIDDR←g₂, and write back (MDR, MCHR, MGIDDR) using the address i in MARI.

[0217] Given a DB′, t_(s) ^(out)(DB), t_(e) ^(out)(DB′), and a gid g, the scheduling of DB′involves searches in P_(M-ext) and P_(G-ext). Searching in P_(M-ext) is done as follows: find the smallest i such that M₁<t_(s) ^(out)(DB) and MGID_(i)=g. Searching in P_(G-ext) is done as follows: find the smallest i such that G_(i,1)<t_(s) ^(out) (DB′), G_(i,2)>t_(s) ^(out) (DB′), and MGID₁=g.

[0218] Similarly, associative processors P_(G-ext) and P_(G*-ext) can be constructed by adding a gid comparand register. MGGIDC, a memory MGGID of m words MGGID₁, MGGID₂, MGGIDm, and a data register MGGIDDR. P_(MG-ext) is a combination of P_(M-ext) and PG-ext. The operations of P_(MG-ext) can be easily derived from the operations of P_(M-ext) and P_(G-ext) since the P_(M-ext) items and the P_(G-ext) items are separated. In P*_(MG-ext), the P_(M-ext) items and the P_(G) ext items are mixed. Since the MG_(i,1) values of these items are in non-decreasing order, finding the P_(M-ext) item corresponding channel Ch₁ can be carried out by finding the smallest j such that MGGID_(j)=i.

[0219] Although the Detailed Description of the invention has been directed to certain exemplary embodiments, various modifications of these embodiments, as well as alternative embodiments, will be suggested to those skilled in the art. The invention encompasses any modifications or alternative embodiments that fall within the scope of the claims. 

1. Circuitry for scheduling data bursts in a optical burst-switched router, comprising: an optical switch for routing optical information from an incoming optical transmission medium to one of a plurality of outgoing optical transmission media; a delay buffer coupled to the optical switch for providing n different delays for delaying information between the incoming transmission medium and the outgoing transmission media; scheduling circuitry associated with each outgoing medium, comprising n+1 associative processors, each associative processor including circuitry for: storing scheduling information for the associated outgoing optical transmission medium relative to a respective one of the n delays and for a zero delay, and identifying available time periods relative to the respective delays in which a data burst may be scheduled.
 2. The circuitry of claim 1 wherein the incoming optical transmission medium and the outgoing optical transmission media comprise optical fibers.
 3. The circuitry of claim 1 wherein the associative processors identify unscheduled time periods.
 4. The circuitry of claim 1 wherein the associative processors identify gaps between scheduled data bursts.
 5. The circuitry of claim 4 and further comprising a second set of n+1 associative processors, wherein the second set of associative processors identify unscheduled time periods.
 6. The circuitry of claim 1 wherein said delay buffer comprises discrete delay lines each coupled a predetermined input and a predetermined output of said optical switch.
 7. The circuitry of claim 1 wherein said delay buffer comprises a matrix of delay lines, where a desired delay line can be coupled between a selected input and selected output of said optical switch.
 8. A method of scheduling data bursts in a optical burst-switched router that routes optical information through an optical switch from an incoming optical transmission medium to one of a plurality of outgoing optical transmission media either directly through the optical switch or via one of n different delays of a delay buffer, comprising the steps of; storing scheduling information in n+1 associative processors for the associated outgoing optical transmission medium relative to a respective one of the n delays and for a zero delay, and concurrently identifying available time periods in each of said associative processors in which a data burst may be scheduled, such that available time periods associated with multiple delays can be simultaneously determined.
 9. The method of claim 1 wherein the incoming optical transmission medium and the outgoing optical transmission media comprise optical fibers.
 10. The method of claim 1 wherein said concurrently identifying step comprises the step of concurrently identifying unscheduled time periods in each of said associative processors.
 11. The method of claim 1 wherein said concurrently identifying step comprises the step of concurrently identifying gaps between data bursts in each of said associative processors.
 12. The method of claim 11 wherein said concurrently identifying step further comprises the step of concurrently identifying unscheduled time periods in each of said associative processors.
 13. The method of claim 1 wherein said delay buffer comprises discrete delay lines each coupled a predetermined input and a predetermined output of said optical switch.
 14. The method of claim 1 wherein said delay buffer comprises a matrix of delay lines, where a desired delay line can be coupled between a selected input and selected output of said optical switch. 